Parsing hungarian sentences in order to determine their prosodic structures in a multilingual TTS system

نویسنده

  • Ilona Koutny
چکیده

Naturally sounding synthesized speech requires proper prosodic structure. The unequivocal relation between syntax and prosody is contestable, but for lack of other information on discourse structure, we have to rely on syntactic structure in order to determine some prosodic features. This work – based on basic research results in Hungarian linguistics – started with a preliminary parser for simple Hungarian sentences. It tries to establish phrase boundaries with possible breaks, and to determine the focus as well as other stressed and unstressed elements The current goal is to automatically predict the above prosodic features for complex sentences. The steps of the prosody module are: pre-processing, morphological analysis, phrase parsing, sentence parsing, search for the focus, stress and break distribution. The prosodic predictor is implemented in the MULTIVOX multilingual text-to-speech system [9].

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

طراحی و ارزیابی یک مدل بازسازی گفتار به روش هم‌گذاری واحدهای حساس به بافت نوایی

This paper describes the design and evaluation of prosodically-sensitive concatenative units for a Persian text-to-speech (TTS) synthesis system. Thesyllables used are prosodically conditioned in the sense that a single conventional syllable is stored as different versions taken directly from the different prosodic domains of the prosodically labeled, read sentences. The three levels of the Per...

متن کامل

Corpus-Based Unit Selection TTS for Hungarian

This paper gives an overview of the design and development of an experimental restricted domain corpus-based unit selection text-tospeech (TTS) system for Hungarian. The experimental system generates weather forecasts in Hungarian. 5260 sentences were recorded creating a speech corpus containing 11 hours of continuous speech. A Hungarian speech recognizer was applied to label speech sound bound...

متن کامل

Tree-based modeling of prosodic phrasing and segmental duration for Korean TTS systems

This study describes the tree-based modeling of prosodic phrasing, pause duration between phrases and segmental duration for Korean TTS systems. We collected 400 sentences from various genres and built a corresponding speech corpus uttered by a professional female announcer. The phonemic and prosodic boundaries were manually marked on the recorded speech, and morphological analysis, grapheme-to...

متن کامل

From multilingual to polyglot speech synthesis

This paper proposes a distinction between existing multilingual synthesis systems and mixed-lingual or polyglot synthesis systems. The latter should be capable of synthesising with the same voice utterances which contain foreign language words or word groups. As a first step towards polyglot synthetic speech, the design and realisation of a 4-lingual single-speaker diphone inventory is detailed...

متن کامل

A New Prosodic Phrasing Model for Chinese TTS Systems

This paper proposes a new prosodic phrasing model for Chinese text-tospeech systems. First, in contrast to the commonly used CART techniques, we propose a new inductive learning algorithm based on the extension matrix theory. Second, we collected 559 sentences (of approximately 78 min length) from news programs and built a corresponding speech corpus uttered by a professional male announcer. Th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999